HMM-based TTS for hanoi vietnamese: issues in design and evaluation

نویسندگان

  • Thi Thu Trang Nguyen
  • Christophe d'Alessandro
  • Albert Rilliard
  • Do Dat Tran
چکیده

This paper presents the development and evaluation of an HMM-based TTS system for the modern Hanoi dialect of Northern Vietnamese, a tonal language. A study of specific phonetic and prosodic features of Hanoi Vietnamese is discussed. Consequences on the design of an HMM-based TTS system are derived. Using this knowledge, a TTS system, called VTed, is then developed under the Mary TTS platform. The second part of the paper is devoted to perceptual evaluations of Vietnamese speech synthesis. Three kinds of evaluations are considered necessary for quality assessment of this tonal language. The general MOS assessment, utterancelevel intelligibility, and tone-level intelligibility tests are conducted on the VTed system under a “natural speech reference” condition. The results show 1.21 points difference between natural and synthetic speech for the MOS test, a 0.2% – 0.9% difference for the utterance-level intelligibility test, 23% on average and – depending on the tone type – from 0% to 37% difference for the tone-level intelligibility test. These results demonstrate the need for more specific works on tonal/prosodic level to improve automatic synthesis of Vietnamese and other tonal languages.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

F0 parameterization of glottalized tones for HMM-based vietnamese TTS

A conventional HMM-based TTS system for Hanoi Vietnamese often suffers from the hoarse quality due to the incomplete F0 parameterization of glottalized tones. As estimating F0 in glottalization is rather problematic for usual F0 extractors, we propose a pitch marking algorithm where the pitch marks are propagated from regular regions of speech signal to glottalized one, from which the complete ...

متن کامل

Intonation issues in HMM-based speech synthesis for Vietnamese

In an HMM-based Text-To-Speech system, contextual features, including phonetic and prosodic factors have a significant influence to the spectrum, F0 and duration of the synthetic voice. This paper proposes prosodic features aiming at improving the naturalness of an HMM-based TTS system (VTed) for a tonal language, Vietnamese. The ToBI (Tones and Break Indices) features are used to learn two cru...

متن کامل

Wedelolactone from Vietnamese Eclipta prostrata (L.) L. protected zymosan-induced shock in mice

Wedelolactone is known to have biological activities such as anti-inflammation hepatitis, anti-hepatotoxic activity, and trypsin inhibitory effect. However, up to date, there has not been studied deeply in the role of wedelolactone for zymosan-induced signaling pathways in the process of regulating the excessive inflammatory responses in host. Here, we demonstrated that wedelolactone plays an e...

متن کامل

A hybrid TTS between unit selection and HMM-based TTS under limited data conditions

The intelligibility of HMM-based TTS can reach that of the original speech. However, HMM-based TTS is far from natural. On the contrary, unit selection TTS is the most-natural sounding TTS currently. However, its intelligibility and naturalness on segmental duration and timing are not stable. Additionally, unit selection needs to store a huge amount of data for concatenation. Recently, hybrid a...

متن کامل

Readiness, Availability and Utilization of Rural Vietnamese Health Facilities for Community Based Primary Care of Non-communicable Diseases: A Cross-Sectional Survey of 3 Provinces in Northern Vietnam

Background Vietnam’s network of commune health centers (CHCs) have historically managed acute infectious diseases and implemented national disease-specific vertical programs. Vietnam has undergone an epidemiological transition towards non-communicable diseases (NCDs). Limited data exist on Vietnamese CHC capacity to prevent, diagnose, and treat NCDs. In this paper, we assess NCD service r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013